I know I stored it somewhere - Contextual Information and Ranking on our Desktop
نویسندگان
چکیده
1 Motivation Future digital libraries will be distributed, and recent research has already explored some promising approaches focusing on distributed and peer-to-peer search and retrieval architectures, connecting distributed repositories efficiently and transparently. Another aspect, which has been less explored so far, is the role of the implicit personal repositories we all have on our desktops, and the efficiency we expect to gain from storing important resources in these " personal digital libraries ". Ironically, in many cases, moving important resources closer to our workspace results in less retrieval efficiency. Sophisticated web search technology usually allows us to find appropriate documents in a few seconds. Finding these documents on our desktop is surprisingly more difficult, at least if we have been storing documents for a few years or more. This is improving somewhat with the recent crop of desktop search engines, but even with these tools, searching through our (relatively small set of) personal documents with the recent beta of Google Desktop Search is inferior to searching the (rather vast set of) documents on the web with Google. The main reason for this is that one of the distinguishing features of Google-sophisticated ranking using PageRank and other features-is unavailable on our desktop. This position paper explores some of the information we have available on our desktop to extend search beyond simple full-text search, and the algorithms we can build upon this information, implementing efficient ranking of resources on our desktop. Regarding the first aspect, we propose activity-based metadata and relationships as sources of additional information in desktop search. This information in many cases represents contextual information, which is very useful for re-finding resources we already worked with. We will describe corresponding metadata for two selected scenarios. For the second aspect, we will focus on recent advances of PageRank-based ranking, extending models such as the ones describing our contextual information, and show how local and global ranking measures can be integrated in such a model. These two techniques together show considerable promise for extending efficient information access to our desktop, extending globally available information with user-centered activity-based information, and exploiting the unique information background we have available on our desktop. We are currently implementing first prototypes in the context of the open source Beagle project which aims to provide sophisticated desktop search in Linux.
منابع مشابه
Semantically Enhanced Searching and Ranking on the Desktop
Existing desktop search applications, trying to keep up with the rapidly increasing storage capacities of our hard disks, offer an incomplete solution for the information retrieval. In this paper we describe our desktop search prototype, which enhances conventional full-text search with semantics and ranking modules. In this prototype we extract and store activity-based metadata explicitly as R...
متن کاملBeagle++: Semantically Enhanced Searching and Ranking on the Desktop
Existing desktop search applications, trying to keep up with the rapidly increasing storage capacities of our hard disks, offer an incomplete solution for information retrieval. In this paper we describe our Beagle desktop search prototype, which enhances conventional fulltext search with semantics and ranking modules. This prototype extracts and stores activity-based metadata explicitly as RDF...
متن کاملDesktop Search - How Contextual Information Influences Search Results & Rankings
1. MOTIVATION Sophisticated web search technology usually allows us to find appropriate documents in a few seconds. Finding these documents on our desktop is surprisingly more difficult, at least if we have been storing documents for a few years or more. This is improving somewhat with the recent crop of desktop search engines, but even with these tools, searching through our (relatively small ...
متن کاملPeer-Sensitive ObjectRank - Valuing Contextual Information in Social Networks
Building on previous work on how to model contextual information for desktop search and how to implement semantically rich information exchange in social networks, we define a new algorithm, Peer-Sensitive ObjectRank for ranking resources on the desktop. The new algorithm takes into account different trust values for each peer, generalizing previous biasing PageRank algorithms. We investigate i...
متن کاملI Know It When I See It: The Challenges of Addressing Corruption in Health Systems; Comment on “We Need to Talk About Corruption in Health Systems”
In this commentary, I argue that corruption in health systems is a critical and legitimate area for research in order to strengthen health policy goals. This rationale is based partly on citizen demand for more accountable and transparent health systems, along with the fact that the poor and vulnerable suffer the most from the presence of corruption in health systems. W...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005